feat(npi): Implement agentic approach to GCSFuse NPI by kislaykishore · Pull Request #166 · GoogleCloudPlatform/gcsfuse-tools

kislaykishore · 2026-06-15T14:24:44Z

This pull request includes several robustness improvements to the GCSFuse NPI validation and benchmarking workflow:

Implement an agent for running GCSFuse NPI.
Automatically resolve Go version from the GCSFuse repo's go.mod during Docker image builds to ensure version alignment.
Update runner agent instructions to enforce user targets inputs and check for active/suspended runs before starting.
Implement stall monitoring instructions and safety timeouts for both conformance tests and performance benchmarks (reducing orchestrator logging inactivity timeout to 5 mins).
Add detailed failure analysis guidelines for conformance testing.
Separate GCE VM and GKE Node performance runs into distinct sections and warnings in analysis and remediation guides, discouraging incorrect direct comparisons.
Explicitly treat remediation planning as advisory-only without performing automatic mutations.

…or enhancements

…artup

…image build

…uidelines

…mparisons

…skills

…om repository scripts

gemini-code-assist

Code Review

This pull request introduces modular agent skills and workflow definitions for GCSFuse NPI validation, along with a verification script to validate deliverables. Key code changes include dynamic Go version resolution in image builds, billing project configuration in GKE jobs, and an increased inactivity timeout in the orchestrator. The review feedback highlights several critical improvements: resolving a hardcoded local directory path and a fragile test count assertion in the verification script, preventing a potential AttributeError in the orchestrator when parsing GCS bucket metadata, and reverting environment-specific configurations in targets.json to generic placeholders.

…nimum threshold of 1000

…ny combination

…rts and outputs

…argets

…o use decimal MB/s instead of binary MiB/s

…om query_results.py

…e absent

… main

… runner prompt

…prompt

kislaykishore · 2026-06-15T17:43:49Z

/Gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking skills, dynamically resolves the Go version from go.mod during image builds, adds billing project support to GKE jobs, increases the orchestrator inactivity timeout to 4 hours, and introduces a fallback to RAM-based memory volumes in the RAID setup script. It also adds a workflow verification script. The review feedback highlights critical fixes needed in the verification script, including resolving file path and header mismatches, lowering the minimum test threshold to prevent false failures, aligning the timeout documentation with the code, and safely handling potential null values in the orchestrator's bucket metadata parsing.

…t documentation

kislaykishore

All code review feedback has been successfully resolved:

Updated the inactivity timeout documentation in benchmark-suite-execution/SKILL.md to match the code's 4-hour (14400s) timeout.
Added safe dictionary navigation (meta.get("customPlacementConfig") or {}) in npi_orchestrator.py to prevent potential AttributeErrors when GCS bucket customPlacementConfig returns None.

…s, report headers, and wildcard conformance result files

kislaykishore

All remaining code review comments on the verification script have been successfully resolved:

Wildcard Conformance JSON Support: Added a glob search fallback in verify_agent_workflow.py to dynamically search for and verify any target-specific conformance JSON results (e.g. conformance_results_*.json) if the default file is not found.
Report Headers Alignment: Aligned the expected headers in the verification script with the actual headers generated in the report (matching the ## Target Performance Results section).
Realistic Test Threshold: Lowered the minimum test count threshold in the verification script to a more realistic limit (total_tests < 100) to prevent false pipeline blocks.

kislaykishore · 2026-06-15T18:01:21Z

/gemini review

… RAM headroom

kislaykishore · 2026-06-16T08:22:23Z

/gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking workflow into distinct agent skills, including conformance testing, SSH management, benchmark setup, execution, analysis, and remediation. Key enhancements include a programmatic verification script, a RAM-based tmpfs fallback for hosts without local SSDs, dynamic Go version resolution from go.mod with input validation, and support for billing project annotations in GKE jobs. The review feedback suggests a safer prefix-based check in npi_gke.py to avoid false positives when detecting the billing-project mount option.

…unt options

kislaykishore · 2026-06-16T08:32:54Z

/gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking workflow by splitting the monolithic run-gcsfuse-npi skill into several agent-specific modular skills. It also introduces dynamic Go version resolution from go.mod in build_images.py, adds input sanitization, supports RAM buffer fallback (tmpfs) in raid0-script.sh for hosts without local SSDs, integrates billing project options in GKE job specs, and adds a programmatic workflow verification script. Feedback on the changes includes addressing a potential bug in npi_gke.py where checking for an existing billing-project option on an unsplit string can fail if multiple comma-separated options are present, and improving verify_agent_workflow.py to ensure target-specific conformance results are not skipped if a stale conformance_results.json file exists.

… both default and target-specific conformance JSONs

kislaykishore · 2026-06-16T08:43:47Z

/gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking workflow into distinct, agent-specific skills (SSH management, conformance testing, benchmark setup, execution, analysis, and remediation) and introduces several robust enhancements. Key updates include dynamically resolving the Go version from the GCSFuse go.mod file, automatically injecting the GCP billing project into GKE CSI mount options, adding a 500GB tmpfs RAM disk fallback in raid0-script.sh for high-RAM hosts, and introducing a programmatic verification script for validation deliverables. Feedback on the changes suggests improving the robustness of extra_flag parsing in npi_gke.py by normalizing spaces around the '=' sign to prevent duplicate billing-project options.

…vent duplicate billing-project options

kislaykishore · 2026-06-16T08:54:35Z

/gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking documentation into several agent-specific skills, such as conformance testing, SSH connection management, benchmark build/setup, suite execution, analysis, and remediation. Additionally, it updates the Python orchestration scripts to dynamically resolve Go versions from go.mod, handle custom placement configurations for RAPID buckets, support RAM tmpfs fallbacks when local SSDs are missing, and introduce a verification script for deliverables. The review feedback focuses on improving code quality by cleaning up redundant local imports of 're' and 'glob' and moving them to the top of the files in accordance with PEP 8 guidelines.

…es.py and verify_agent_workflow.py

kislaykishore · 2026-06-16T09:05:42Z

/gemini review

gemini-code-assist

Code Review

This pull request modularizes the GCSFuse NPI validation and benchmarking workflow by splitting the monolithic run-gcsfuse-npi skill into several focused, agent-specific skill files (conformance testing, SSH connection management, benchmark build/setup, benchmark suite execution, analysis/report generation, and remediation advisor). Additionally, it introduces a programmatic verification script (verify_agent_workflow.py), adds a RAM-based tmpfs fallback in raid0-script.sh for hosts without local SSDs, dynamically resolves the Go version from go.mod in build_images.py with strict input sanitization, and enhances Kubernetes job specification generation in npi_gke.py to support billing project configurations. No review comments were provided, so there is no feedback to address.

…sting RAID0 mounts, clarify memory buffer fallback on TPU GCE VMs, and add .gitignore for ephemeral .agents workspace

…ust conformance scripts

…tion in agent spec

…orage and bucket analysis

…able and dataset schemas

…g to analysis skill

…t make npi-conformance

kislaykishore added 12 commits June 14, 2026 21:14

feat(npi): modularize skills, add verification utility and orchestrat…

3421db4

…or enhancements

feat(npi): add gcsfuse-npi-runner agent definition file

654fd00

feat(npi): update runner agent prompt to enforce user input targets

b27467c

feat(npi): update runner agent prompt to check active run state on st…

290ec4e

…artup

feat(npi): dynamically resolve go version from GCSFuse go.mod during …

55aaa24

…image build

feat(npi): add failure analysis guidelines for conformance testing

1ebe734

docs(npi): exclude emulator_tests from conformance guide to prevent hang

9c1289f

perf(npi): reduce log inactivity check from 60 mins to 5 mins

89a2b57

feat(npi): incorporate conformance and performance stall monitoring g…

d65eeb5

…uidelines

docs(npi): separate GKE and GCE performance runs and remove direct co…

a3b7725

…mparisons

docs(npi): add cross-target comparison and advisory-only warnings to …

e5232c3

…skills

chore(npi): remove hardcoded personal VM, bucket, and path strings fr…

9bbe4d8

…om repository scripts

gemini-code-assist Bot reviewed Jun 15, 2026

View reviewed changes

Comment thread npi/verify_agent_workflow.py

Comment thread npi/npi_orchestrator.py Outdated

Comment thread npi/verify_agent_workflow.py Outdated

Comment thread npi/targets.json Outdated

Comment thread npi/targets.json Outdated

kislaykishore added 10 commits June 15, 2026 19:58

fix(npi): relax total_tests check in verify_agent_workflow.py to a mi…

cff67dc

…nimum threshold of 1000

docs(npi): update runner agent prompt to support list of targets in a…

66dda45

…ny combination

docs(npi): update skills to support multiple target verification repo…

16ee1a4

…rts and outputs

docs(npi): clarify that conformance tests are only supported on GCE t…

f251783

…argets

fix(npi): update benchmark query scripts and skill report templates t…

edd19bc

…o use decimal MB/s instead of binary MiB/s

fix(npi): remove LRO ON TLS handshake failure special-case handler fr…

37e55e1

…om query_results.py

feat(npi): support tmpfs RAM disk fallback buffer of 600GB if SSDs ar…

41b75b0

…e absent

revert(npi): restore targets.json to its original template state from…

57cf831

… main

docs(npi): allow concurrent target runs in different regions/hosts in…

a474aac

… runner prompt

revert(npi): restore Sequential Execution constraint in runner agent …

5edd292

…prompt

gemini-code-assist Bot reviewed Jun 15, 2026

View reviewed changes

Comment thread npi/verify_agent_workflow.py Outdated

Comment thread npi/verify_agent_workflow.py

Comment thread npi/verify_agent_workflow.py Outdated

Comment thread npi/.gemini/skills/benchmark-suite-execution/SKILL.md Outdated

Comment thread npi/npi_orchestrator.py Outdated

fix(npi): resolve code review feedback on metadata parsing and timeou…

2c05674

…t documentation

kislaykishore commented Jun 15, 2026

View reviewed changes

fix(npi): resolve code review comments regarding test count threshold…

80b764b

…s, report headers, and wildcard conformance result files

kislaykishore commented Jun 15, 2026

View reviewed changes

fix(npi): limit fallback tmpfs memory volume to 500GB to leave system…

6c137c7

… RAM headroom